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Summary 

The frame -based knowledge representation model adopted in IDHS (Intelligent Dictionary Help 
System) is described in this paper. It is used to represent the lexical knowledge acquired 
automatically from a conventional dictionary. Moreover, the enrichment processes that have been 
performed on the Dictionary Knowledge Base and the dynamic exploitation of this knowledge 
— both based on the exploitation of the properties of lexical semantic relations — are also 
described. 
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1 INTRODUCTION. 

IDHS (Intelligent Dictionary Help System) is 
conceived as a monolingual (explanatory) dictionary 
system for human use (Artola & Evrard, 92). The fact 
that it is intended for people instead of automatic 
processing distinguishes it from other systems dealing 
with semantic knowledge acquisition from 
conventional dictionaries. The system provides various 
access possibilities to the data, allowing to deduce 
implicit knowledge from the explicit dictionary 
information. IDHS deals with reasoning mechanisms 
analogous to those used by humans when they consult 
a dictionary. User level functionality of the system has 
been defined and is partially implemented. 

The starting point of IDHS is a Dictionary Database 
(DDB) built from an ordinary French dictionary. 
Meaning definitions have been analysed using 
linguistic information from the DDB itself and 
interpreted to be structured as a Dictionary Knowledge 
Base (DKB). As a result of the parsing, different 
lexical- semantic relations between word senses are 
established by means of semantic rules (attached to the 
patterns); this rules are used for the initial construction 
of the DKB. 

This paper describes the knowledge representation 
model adopted in IDHS to represent the lexical 
knowledge acquired from the source dictionary. Once 
the acquisition process has been performed and the 
DKB built, some enrichment processes have been 
executed on the DKB in order to enhance its 
knowledge about the words in the language. Besides, 
the dynamic exploitation of this knowledge is made 
possible by means of specially conceived deduction 
mechanisms. Both the enrichment processes and the 
dynamic deduction mechanisms are based on the 
exploitation of the properties of the lexical semantic 
relations represented in the DKB. 

In the following section an overview of IDHS is 
given. Section 3 briefly presents the process of 
construction of the DKB. The knowledge 
representation model and the enrichment mechanisms 
are fully described in sections 4 and 5. Section 6 
describes some inferential aspects of the system. 
Finally, in section 7, some figures about the size of the 
prototype built are presented. 

2 THE IDHS DICTIONARY SYSTEM. 

IDHS is a dictionary help system intended to assist a 
human user in language comprehension or production 
tasks. The system provides a set of functions that have 



been inspired by the different reasoning processes a 
human user performs when consulting a conventional 
dictionary, such as definition queries, search of 
alternative definitions, differences, relations and 
analogies between concepts, thesaurus-like word 
search, verification of concept properties and 
interconceptual relationships, etc. (Arregi etal. , 91). 

IDHS can be seen as a repository of dictionary 
knowledge apt to be accessed and exploited in several 
ways. The system has been implemented on a 
symbolic architecture machine using KEE knowledge 
engineering environment. 

Two phases are distinguished in the construction of 
the DKB. Firstly, information contained in the DDB is 
used to produce an initial DKB. General information 
about the entries obtained from the DDB (POS, usage, 
examples, etc.) is conventionally represented 
— attribute-value pairs in the frame structure — while 
the semantic component of the dictionary, i.e. the 
definition sentences, has been analysed and 
represented as an interrelated set of concepts. In this 
stage the relations established between concepts could 
still be, in some cases, of lexical-syntactic nature. In a 
second phase, the semantic knowledge acquisition 
process is completed using for that the relations 
established in the initial DKB. The purpose of this 
phase is to perform lexical and syntactical 
disambiguation, showing that semantic knowledge 
about hierarchical relations between concepts can be 
determinant for this. 

3 BUILDING THE DICTIONARY KNOWLEDGE 
BASE. 

The starting point of this system is a small 
monolingual French dictionary (Le Plus Petit 
Larousse, Paris: Librairie Larousse, 1980) consisting 
of nearly 23,000 senses related to almost 16,000 
entries. The dictionary was recorded in a relational 
database: the Dictionary Database (DDB). This DDB 
is the basis of every empirical study that has been 
developed in order to design the final model proposed 
for representation and intelligent exploitation of the 
dictionary. 

The definition sentences have been analysed in the 
process of transformation of the data contained in the 
DDB to produce the DKB. The analysis mechanism 
used is based on hierarchies of phrasal patterns 
(Alshawi, 89). The semantic structure associated to 
each analysis pattern is expressed by means of a 
Semantic Structure Construction Rule (SSCR). The 
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process of construction of the DKB is automatic and 
based on these SSCR's (Artola, 93). 

The interconceptual lexical-semantic relations 
detected from the analysis of the source dictionary are 
classified into paradigmatic and syntagmatic. Among 
the paradigmatic relations, the following have been 
found: synonymy and antonymy, taxonomic relations 
as hypemymy/hyponymy — obtained from definitions 
of type "genus et differentia" — , and taxonymy itself 
(expressed by means of specific relators such as sorte 
de and espece de), meronymy, and others as gradation 
(for adjectives and verbs), equivalence (between 
adjectives and past participle), factitive and reflexive 
(for verbs), lack and reference (to the previous sense). 
Whereas among the syntagmatic relations, i.e. those 
that relate concepts belonging to different POS's, 
derivation is the most important, but also relationships 
between concepts without any morphological relation 
as case relations, attributive (for verbs), lack and 
conformity have been detected. 

The hierarchies created have already been used to 
parse all the noun, verb, and adjective definitions in 
the DDB. The hierarchy devoted to analyze noun 
definitions is formed with 65 patterns, 49 different 
patterns have been defined to analyze verb definitions, 
and 45 for adjectives. Although it is a partial parsing 
procedure, 57.76% of noun definitions, 79.8% of verbs 
and 69.04% of those corresponding to adjectives have 
been totally "caught" in this application. However, 
with this technique of partial parsing, the parse is 
considered successful when an initial phrase structure 
is recognized, which in general contains the genus or 
superordinate of the defined sense. This is not so for 
the case of lexicographic meta-language constructions 
(specific relators), whose corresponding semantic 
structure is built in a specific way and which deserve 
also specific patterns in the hierarchies. 



4 REPRESENTATION OF THE DICTIONARY 
KNOWLEDGE: THE DKB. 

As we have just seen, the knowledge representation 
scheme chosen for the DKB of IDHS is composed of 
three elements, each of them structured as a different 
knowledge base: 

• KB-THESAURUS is the representation of the 
dictionary as a semantic network of frames, where 
each frame represents a one-word concept (word 
sense) or a phrasal concept. Phrasal concepts 
represent phrase structures associated to the 
occurrence of concepts in meaning definitions. 
Frames — or units — are interrelated by slots 
representing lexical-semantic relations such as 
synonymy, taxonomic relations (hypernymy, 
hyponymy, and taxonymy itself), meronymic 
relations (part-of, element-of, set-of, member-of), 
specific relations realised by means of meta- 
linguistic relators, casuals, etc. Other slots contain 
phrasal, meta-linguistic, and general information. 

• KB-DICTIONARY allows access from the 
dictionary word level to the corresponding concept 
level in the DKB. Units in this knowledge base 
represent the entries (words) of the dictionary and 



are directly linked to their corresponding senses in 
KB-THESAURUS. 

• KB-STRUCTURES contains meta-knowledge 
about concepts and relations in KB-DICTIONARY 
and KB-THESAURUS: all the different structures 
in the DKB are defined here specifying the 
corresponding slots and describing the slots by 
means of facets that specify their value ranges, 
inheritance modes, etc. Units in KB-THESAURUS 
and KB-DICTIONARY are subclasses or instances 
of classes defined in KB-STRUCTURES. 

Fig. 1 gives a partial view of the three knowledge 
bases which form the DKB with their correspondent 
units and their inter/intra relationships. 

In the KB-THESAURUS, some of the links 
representing lexical-semantic relations are created 
when building the initial version of the knowledge 
base, while others are deduced later by means of 
specially conceived deduction mechanisms. 

When a dictionary entry like spatule 1 1: sorte de 
cuiller plate (a kind of flat spoon) is treated, new 
concept units are created in KB-THESAURUS (and 
subsidiarily in KB-DICTIONARY) and linked to 
others previously included in it. Due to the effect of 
these links new values for some properties are 
propagated through the resulting taxonomy. 
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Fig. 1.- The Dictionary Knowledge Base. 

SUBCLASS link 

- - - MEMBER-OF link (instance) 

(1) Taxonomic Relation: HYPERNYM/HYPONYM 

(2) Specific (meta-linguistic) relation: SORTE-DE /SORTE-DE-^INV 
(KIND-OF/KIND-OF-HINV) 

(3) CARACTERISTIQUE/CARACTERISTIQUE-HlNV 
(PROPERTY/PROPERTY-flNV) relation 

(4) MOTS-ENTREE /SENS (ENTRY- WORD / WORD-SENSE) relation 

In the example, although it is not explicit in the 
definition, spatule is "a kind of ustensile and so it will 
inherit some of its characteristics (depending upon the 
inheritance role of each attribute). Fig. 1 also shows 
the types of concepts used: spatule 1 1 and cuiller 1 1 
are noun definitions and considered subclasses of 
ENTITIES while plat 1 1 (an adjective) is a subclass of 



QUALITIES. The phrasal concept unit representing 
the noun phrase cuiller plate is treated as a hyponym 
of its nuclear concept {cuiller 1 1). 

4,1 KB-STRUCTURES: the meta-knowledge. 

This knowledge base reflects the hierarchical 
organisation of the knowledge included in the DKB. 

We will focus on the LKB-STRUCTURES class 
which defines the data types used in KB- 
DICTIONARY and KB-THESAURUS, and that 
organises the units belonging to these knowledge bases 
into a taxonomy. 

Slots defined in KB-STRUCTURES have 
associated aspects such as the value class, the 
inheritance role determining how values in children's 
slots are calculated, and so on. Each lexical-semantic 
relation — represented by an attribute or slot — has its 
own inheritance role. For instance, the inheritance role 
of the CARACTERISTIQUE relation states that every 
concept inherits the union of the values of the 
hypemyms for that relation, while the role defined for 
the SYNONYMES relation inhibits value inheritance 
from a concept to its hyponyms. 

The subclasses defined under LKB-STRUCTURES 

are the following: 

• ENTRIES, that groups dictionary entries belonging 
to KB-DICTIONARY; 

• DEFINITIONS, that groups word senses classified 
according to their POS; 

• REFERENCES, concepts created in KB- 
THESAURUS due to their occurrence in definitions 
of other concepts ("definitionless"); 

• CONCEPTS, that groups, under a conceptual point 
of view, word senses and other conceptual units of 
KB-THESAURUS. 

The classification of conceptual units under this last 

class is as follows: 

• TYPE-CONCEPTS correspond to Quillian's 
(1968) "type nodes"; this class is, in fact, like a 
superclass under which every concept of KB- 
THESAURUS is placed. It is further subdivided in 
the classes ENTITIES, ACTIONS/EVENTS, 
QUALITIES and STATES, that classify different 
types of concepts. 

• PHRASAL-CONCEPTS is a class that includes 
concepts similar to Quillian's "tokens" 
— occurrences of type concepts in the definition 
sentences — . Phrasal concepts are the representation 
of phrase structures which are composed by several 
concepts with semantic content. A phrasal concept 
is always built as a subclass of the class which 
represents its head (the noun of a noun phrase, the 
verb of a verb phrase, and so on), and integrated in 
the conceptual taxonomy. Phrasal concepts are 
classified into NOMINALS, VERBALS, 
ADJECTIVALS, and ADVERBIALS. 

For instance, Iplante I 1#3I is a phrasal concept (see 
Fig. 2), subclass of the type concept Iplante III, and 
represents the noun phrase "une plante d'omement" . 



• Finally, the concepts that, after the analysis phase, 
are not yet completely disambiguated (lexical 
ambiguity), are placed under the class 
AMBIGUOUS-CONCEPTS, which is further 
subdivided into the subclasses HOMOGRAPHE 
(e.g. Ifaculte ? ?l), SENSE (Ipanser I ?l), and 
COMPLEX (Idonner I 5/61), in order to distinguish 
them according to the level of ambiguity they 
present. 

The links between units in KB-THESAURUS and 
KB-DICTIONARY are implemented by means of slots 
tagged with the name of the link they represent. These 
slots are defined in the different classes of KB- 
STRUCTURES. 

The representation model used in the system is 
made up of two levels: 

• Definitory level, where the surface representation of 
the definition of each sense is made. 
Morphosyntactic features like verb mode, time, 
determination, etc. are represented by means of 
facets attached to the attributes. The definitory level 
is implemented using representational attributes. 
Examples of this kind of attributes are: DEE- 
SORTED, D E F-QUI, CARACTERISTIQUE and 
AVEC. 

• Relational level, that reflects the relational view of 
the lexicon. It supports the deductive behaviour of 

the system and is made up by means of relational 
attributes, that may eventually contain deduced 
knowledge. These attributes, defined in the class 
TYPE-CONCEPTS, are the implementation of the 
interconceptual relations: ANTONYMES, AGENT, 
CARACTERISTIQUE, SORTE-IK, CE-QUI, etc. 

4.2 KB-DICTIONARY: from words to concepts. 

This knowledge base represents the links between each 
dictionary entry and its senses (see link 4 in Fig. 1). 

4.3 KB-THESAURUS: the concept network. 

KB-THESAURUS stores the concept network that is 
implemented as a network of frames. Each node in the 
net is a frame that represents a conceptual unit: one- 

word concepts and phrasal concepts. 

The arcs interconnect the concepts and represent 
lexical- semantic relations; they are implemented by 
means of frame slots containing pointers to other 
concepts. Hypernym and hyponym relations have been 
made explicit, making up a concept taxonomy. These 
taxonomic relations have been implemented using the 
environment hierarchical relationship, in order to get 
inheritance automatically. 

Let us show an example. The representation of the 
following definition 

geranium 1 1: une plante d'ornement 

requires the creation of two new conceptual units in 
THESAURUS: the one which corresponds to the 
definiendum and the phrasal concept which represents 
the noun phrase of the definition. Moreover, the units 



which represent plante and omement are to be created 
also (if they have not been previously created because 
their occurrence in another definition). 

Let us suppose that three new units are created: 
Igeranium III, Iplante 1 1#3I and lomementi II. 

Attributes in the units may contain facets (attributes 
for the attributes) used in the definitory level to record 
aspects Uke determination, genre and so on, but also to 
establish the relations between definitory attributes 
with their corresponding relational, or to specify the 
certainty that the value in a representational attribute 
has to be "promoted" to a corresponding relational (see 
below the case of the slot DE in Iplante 1 1#3I). 

Following is given the composition of the frames of 
these three units at the definitory level of 
representation (slots are in small capitals whereas facet 
identifiers are in itaUcs): 

Igeranium I II 

MEMBER. CF : NOMS 

GROUPE-CATEGORIEL : NOM 
CLASSE-ATTRIBUT: INFO-GENERALE 

TEXTE-DEFiNmoN: "unc plante d'omement" 

CLASSE-ATTRIBUT: INFO-GENERALE 

def-classique: Iplante 1 1#3I 

CLASSE-ATTRIBUT: DEFINITOIRES 
DETERMINATION: UN 
GENRE: F 

RELATIONNELS-CORRESPONDANTS: DEFINI-PAR 

Iplante 1 1#3I 

SUBCLASS. CF : Iplante 1 1 1 
MEMBER. CF : NOMINALES 
texte: "plante d'omement" 

CLASSE-ATTRIBUT: INFO-GENERALE 
DE: lomement I 1 1 

CLASSE-ATTRIBUT: SYNTACMATIQUES 

RELATIONNELS-CORRESPONDANTS: ORIGINE, POSSESSEUR, 
MATIERE, OBJECTIF 
OBJECTIF: 0.9 

lornement I II 

MEMBER. CF: REFERENCES 

Before showing the representation of these units at 
the relational level, it has to be said that after the initial 
DKB has been built some deductive procedures have 
been executed: e.g. deduction of inverse relationships, 
taxonomy formation, etc. It is to say that in Fig. 2, 
where the relational view is presented, the relations 
deduced by these procedures are also represented. 

The conceptual units in THESAURUS are placed in 
two layers (see Fig. 2), recalling the two planes of 
QuilUan. The upper layer corresponds to type concepts 
whereas in the lower phrasal concepts are placed. 
Every phrasal concept is placed in the taxonomy 
directly depending from its nuclear concept, as a 
hyponym of it. 

It is interesting to notice in the figure the relation of 
conceptual equivalence established between 
Igeranium I II and Iplante I 1#3I (link labelled (3)). 
These units represent, in fact, the same concept 
because Iplante I 1#3I, standing for "une plante 
d'omement", is the definition of Igeranium III. 
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Fig. 2. Relational view of the concept Igeranium I II 
(in the THESAURUS net). 

The frame of Igeranium I II at the relational level of 
representation takes the following aspect, once the 
relational attributes have been (partially) completed: 

Igeranium I II 

SUBCLASS. CF: ENTITES, Iplante I II 

MEMBER. CF : NOMS 

GROUPE- CATEGORIEL : NOM 
CLASSE-ATTRIBUT: INFO-GENERALE 

TEXTE- DEFiNmoN: "une plante d'omement" 

CLASSE-ATTRIBUT: INFO-GENERALE 

DEF-CLASSlQUE:lplanteI 1#3I 

CLASSE-ATTRIBUT: DEFINITOIRES 
DETERMINATION: UN 
GENRE: F 

RELATIONNELS-CORRESPONDANTS: DEFINI-PAR 

defini-par: Iplante II #3 1 

CLASSE-ATTRIBUT: RELATIONNELS 
INVERSES-CORRESPONDANTS:DEFINrTION-DE 

OBJECTIF: lomement I II 

CLASSE-ATTRIBUT: RELATIONNELS 
INVERSES-CORRESPONDANTS: OBJECTIF+INV 

Let us show now another example. It is the case of 
two definitions stated by means of two different 
stereotyped formulae belonging to the lexicographic 
meta-language. Many verbs in the LPPL are defined 
by means of a formula beginning with "rendre " and 
many nouns with one beginning with "qui". The 
definitions selected for this example correspond to the 
entries publier 1 1 and ajusteur 1 1 , which are 
represented at the definitory level using the meta- 
language attributes DEF-RENDRE and DEF-QUI 
respectively: 

publier 1 1: rendre public 

ajusteur 1 1: qui ajuste des pieces de metal 

The frame corresponding to Ipublier I II is the 

following: 

Ipublier I II 

MEMBER. CF: VERBES 

GROUPE- CATEGORIEL : VERBE 
CLASSE-ATTRIBUT: INFO-GENERALE 

texte- DEFINITION : "rendre public" 

CLASSE-ATTRIBUT: INFO-GENERALE 

DEF- rendre: I public I II 

CLASSE-ATTRIBUT: DEFINITOIRES 
RELATIONNELS-CORRESPONDANTS: RENDRE 

where it can be seen that no phrasal concept is 
involved because the link (DEF-RENDRE) is 
established directly between Ipublier I II and 
Ipublic III. However, in the case of the definition of 
ajusteur I 1, two phrasal concepts are created: the 
attribute DEF-QUI points to the phrasal concept lajuster 



I l#ll, representing "ajuster des pieces de metal", and 
this phrasal concept, in turn, has a syntagmatic 
attribute (OB JET) pointing to a nominal that represents 
"piece de metal". Let us show the frames involved in 
this last case: 

lajusteur I II 

MEMBER. CF: NOMS 

GROUPE-CATEGORIEL : NOM 

CLASSE-ATTRIBUT: INFO-CENERALE 

TEXTE- DEFINITION : "qui ajuste des pieces de metal" 

CLASSE-ATTRIBUT: INFO-CENERALE 

DEE- qui: I ajuster I 1#1 1 

CLASSE-ATTRIBUT: DEFINITOIRES 
MODE: IND 
ASPECT: NT 
TEMPS: PRES 
PERSONNE: 3 

RELATIONNELS-CORRESPONDANTS: QUI 

lajuster I l#ll 

SUBCLASS. CF: lajusterl 1 1 
MEMBER. CF : VERBALES 
TEXTE: "ajuster des pieces de metal" 

CLASSE-ATTRIBUT: INFO-GENERALE 

OB JET: Ipiece I 1#2I 

CLASSE-ATTRIBUT: SYNTAGMATIQUES 
DETERMINATION: UN 
NOMBRE: PL 

RELATIONNELS-CORRESPONDANTS: THEME 

Ipiece 1 1#2I 

SUBCLASS. of: Ipiece I 1 1 
MEMBER. CF: NOMINALES 
TEXTE: "piece de metal " 

CLASSE-ATTRIBUT: INFO-GENERALE 

de: Imetall 1! 

CLASSE-ATTRIBUT: SYNTAGMATIQUES 

RELATIONNELS-CORRESPONDANTS: ORIGINS, POSSESSEUR, 
MATIERE, OBJECTIF 
MATIERE: 0.9 

Frequently, phrasal concepts represent "unlabelled" 
concepts, i.e., they indeed represent concepts that do 
not have a significant in the language. For instance, 
there is not, at least in French, a verbal concept 
meaning 'ajuster des pieces de metal' nor a noun 
meaning 'piece de metal'. However, this is not the case 
of the phrasal concepts that are linked to type concepts 
by means of the relation DEFINI-PAR /DEFINITION -DE, 
because there, the phrasal concept is, in fact, another 
representation of the concept being defined (see above 
the example of the definition of geranium II). In the 
representation model proposed in this work, phrasal 
concepts denote concepts that are typically expressed 
in a periphrastic way and that do not have necessarily 
any corresponding entry in the dictionary ^ . 

Another interesting point related to the creation of 
these phrasal concepts is the maintenance of direct 
links between a concept and all the occurrences of this 
concept in the definition sentences of other concepts. It 
gives, in fact, a virtual set of usage examples that may 
be useful for different functions of the final system. 



This could be very interesting also, in the opinion of the authors, in a 
multilingual environment: it is possible that, in another language, the concept 
equivalent to that which has been represented by the phrasal concept 
Ipiece I 1#2I has its own significant, a word that denotes it. In this case, the 
phrasal concept based representation may be useful to represent the 
equivalence between both concepts. 



5 ENRICHMENT PROCESSES PERFORMED 
ON THE DKB. 

In this section the enrichment processes accomplished 
on the DKB are explained. Two phases are 
distinguished: (a) the enrichment obtained during the 
construction of the initial DKB, and (b), where 
different tasks concerning mainly the exploitation of 
the properties of synonymy and taxonymy have been 
performed. 

5.1 Enrichment obtained during the construction of 
the initial DKB. 

KB-THESAURUS itself, represented —as a 
network — at the relational level, can be considered an 
enrichment of the definitory level because, while the 
DKB was built, the following processes have been 
performed: 

• Values coming from the definitory level have been 
promoted to the relational level. 

• Values coming from the unit which represents the 
definiens have been transferred to the 
corresponding definiendum unit. 

• The maintenance of the relations in both directions 
has been automatically guaranteed. 

• The concepts included in REFERENCES have been 
directly related to other concepts. 

• The taxonomy of concepts has been made explicit, 
thus obtaining value inheritance. 

5.2 Second phase in the enriclunent of the DKB. 

Several processes have been carried out in order to 
infer new facts to be asserted in the DKB^. The 
enrichment obtained in this phase concerns the two 
following aspects: 

• Exploitation of the properties of the synonymy 
(symmetric and transitive). 

• Enlargement of the concept taxonomy based on 
synonymy. 

Another aspect that has been considered to be 
exploited in this phase is that of disambiguation. The 
use of the lexical- semantic knowledge about 
hierarchical relations contained in the DKB can be 
determinant in order to reduce the level of lexical and 
syntactical ambiguity^. Heuristics based on the 
taxonomic and synonymic knowledge obtained 
previously have been considered in this phase. Some 
of them have been designed, implemented and 
evaluated in a sample of the DKB. 

6 INFERENTIAL ASPECTS: DYNAMIC 
DEDUCTION OF KNOWLEDGE. 

Dynamic acquisition of knowledge deals with the 
knowledge not explicitly represented in the DKB and 



By means of rules fired following a forward chaining strategy. 
3 Lexical ambiguity comes from the definitions themselves; syntactical 
ambiguity is due mainly to the analysis process. 



captured by means of especially conceived 
mechanisms which are activated when the system is to 
answer a question posed by the user (Arregi et al. , 91). 
The following aspects are considered: 

• Inheritance (concept taxonomy). 

• Composition of lexical relations. 

• Links between concepts and relations: users are 
allowed to use actual concepts to denote 
relationships (and not only primitive relations). 

• Ambiguity in the DKB: treatment of remaining 
uncertainty. 

In the following, some aspects concerning to the 
second point will be discussed. 

In IDHS, the relationships among the different 
lexical-semantic relations can be easily expressed in a 
declarative way. It is the way of expressing these 
relationships that is called the composition of lexical 
relations. From an operative point of view, this 
mechanism permits the dynamic exploitation — under 
the user's requests — of the properties of the lexical 
relations in a direct manner. It is, in fact, a way of 
acquiring implicit knowledge from the DKB. 

The declarative aspect of the mechanism is based 
on the definition of triples: each triple expresses a 
relationship among different lexical- semantic relations. 
These triples have the form (Ri R2 R3), where Ri 
represents a lexical relation^. The operative effect of 
these declarations is the dynamic creation of 
transitivity rules based on the triples stated. The 
general form of these rules is the following: 

if X Ri Y and Y R2 Z then X R3 Z 

When the value(s) of the attribute R3 are asked, a 
reading demon (attached to the attribute) creates the 
rule and fires the reasoning process with a backward - 
chaining strategy. The deduced facts, if any, will not 
be asserted in the DKB, but in a temporary context. 

For instance, the problem of transitivity in 
meronymic relations (Cruse, 86; Winston et al. , 87) 
can be easily expressed by stating the triple (PARTIE- 
DE PARTIE-DE PARTIE-IK) but not Stating, for 
instance, (PARTIE-DE MEMBRE -DE PARTTE-IX), thus 
expressing that the transitivity in the second case is not 
true. Examples of other triples that have been stated in 
the system are: 

• Combination of meronymic and non-meronymic 
relations: 

(PARTIE-DE LOCATIF LOCATIF) 
(LOCATIF HYPERONYME LOCATIF) 
(MEMBRE-DE HYPERONYME MEMBRE-IK) 

• Combination of relations derived from the 
definition meta-language: 

(CARACTERISTIQUE QUI -A POSSESSION) 
(OBJECTIF CE-QUI OBJECTIF) 

Explicit rules of lexical composition can be used 
when the general form of the triples is not vaUd. These 
rules are used following the same reasoning strategy. 



Following is given the rule derived from the last 
triple and one instance of it. By means of this rule 
instance, the fact that the purpose of a geranium is the 
action of orner is deduced from the definitions of 
geranium and omement: 

if X OBJECTIF Y and ;;; the objective of X is Y (entity) 

YcE-QuiZ ;;; Y "est ce qui" Z (action) 

then X OBJECTIF Z ;;; the objective of X is Z (action) 

if Igeranium I II objectif lomementi II and 

lomement I II OE-qui lomer I II 
then Igeranium I II objecitf lomer I II 



7 THE PROTOTYPE OF IDHS: SIZE OF THE 
DKB. 

Following some figures are given in order to show 
the size of the prototype obtained after the initial 
construction of the DKB. This prototype contains an 
important subset of the source dictionary. 

KB-DICTIONNAIRE contains 2400 entries, each 
one representing one word. KB-THESAURUS 
contains 6130 conceptual units; 1738 units of these are 
phrasal concepts. In this KB there are 1255 ambiguous 
concepts. Once the initial construction phase was 
finished, 19691 relational arcs — interconceptual 
relationships — had been estabUshed. 

After the enrichment processes, the number of 
relational links have been incremented up to 21800 
(10.7%). It has been estimated that, using the 
mechanism of lexical composition, the number 
interconceptual relations could reach an increment of 
between 5 and 10%^. 

8 CONCLUSIONS. 

A frame-based knowledge representation model has 
been described. This model has been used in an 
Intelligent Dictionary Help System to represent the 
lexical knowledge acquired automatically from a 
conventional dictionary. 

The characterisation of the different interconceptual 
lexical-semantic relations is the basis for the proposed 
model and it has been established as a result of the 
analysis process carried out on dictionary definitions. 

Several enrichment processes have been performed 
on the DKB — after the initial construction — in order 
to add new facts to it; these processes are based on the 
exploitation of the properties of lexical-semantic 
relations. Moreover, a mechanism for acquiring — in a 
dynamic way — knowledge not exphcitly represented 
in the DKB is proposed. This mechanism is based on 
the composition of lexical relations. 



The result of the transitivity rule that will be created will be the deduction of 
values for the R3 attribute. The triples are stored in a facet of R3. 



Considering only the set of triples declared until now. 



REFERENCES. 

Alshawi, H. (1989). Analysing dictionary definitions 

in B. Boguraev, T. Briscoe eds., pp. 153-169, 

Computational Lexicography for Natural Language 

Processing. New York: Longman. 

Amsler, R.A. (1981). A. Taxonomy for English Nouns 

and Verbs, Proc. 19th Annual Meeting ACL, pp. 133- 

138. 

Arango Gaviria, G. (1983). Une approche pour 
amorcer le processus de comprehension et d'utilisation 
du sens des mots en langage naturel. These de 3eme 
cycle (Paris VI). Publications du Groupe de Recherche 
Claude Frangois Picard. 

Arregi X., X. Artola, A. Diaz de llarraza, F. Evrard, K. 
Sarasola (1991). Aproximacion funcional a DIAC: 
Diccionario inteligente de ayuda a la comprension, 
Proc. SEPLN, 11, pp. 127-138. 

Artola X., F. Evrard. (1992). Dictionnaire intelligent 
d'aide a la comprehension, Actas IV Congreso Int. 
EURALEX'90 (Benalmadena), pp. 45-57. Barcelona: 
Biblograph. 

Artola X. (1993). "fflZTSUA: Hiztegi-sistema urgazle 
adimendunaren sorkuntza eta eraikuntza / Conception 
d'un systeme intelligent d'aide dictionnariale (SIAD)" 
Ph.D. Thesis. UPV-EHU. 

Boguraev B., T. Briscoe eds. (1989). Computational 
Lexicography for Natural Language Processing. New 
York: Longman. 

Byrd R.J., N. Calzolari, M.S. Chodorow, J.L. Klavans, 

M.S. Neff, O.A. Rizk (1987). Tools and Methods for 

Computational Lexicography, Computational 

Linguistics 13, 3-4, pp. 219-240. 

Calzolari, N. (1984). Machine-readable dictionaries, 

lexical data bases and the lexical system, Proc. 

COLING (Standford Univ.), p. 460. 

Calzolari N., E. Picchi (1988). Acquisition of semantic 

information from an on-line dictionary, Proc. COLING 

(Budapest), pp. 87-92. 

Chodorow M.S., R.J. Byrd (1985). Extracting semantic 
hierarchies from a large on-line dictionary, Proc. ACL, 
pp. 299-304. 

Chouraqui E., E. Godbert (1989). Representation des 
descriptions definies dans un reseau semantique, Actes 
7eme Congres Reconnaissance des Formes et 
Intelligence Artificielle (AFCET-INRIA, Paris), pp. 
855-868. 

Copestake, A. (1990). An approach to building the 
hierarchical element of a lexical knowledge base from 
a machine-readable dictionary, paper read at First Int. 
Workshop on Inheritance in NLP (Tilburg). 
Cruse D.A. (1986). Lexical Semantics. Cambridge: 
Cambridge University Press. 

van den Hurk 1., W. Meijs. The dictionary as a corpus: 
analyzing LDOCE's definition-language. Corpus 
Linguistics II, pp. 99-125. 

Litkowsky K.C. (1978). Models of the semantic 
structure of dictionaries, American Journal of 
Computational Linguistics, Mf. 81, pp. 25-74. 



Markowitz J., T. Ahlswede, M. Evens (1986). 
Semantically significant patterns in dictionary 
definitions, Proc. 24th Annual Meeting ACL (New 
York), pp. 112-119. 

Pazienza M.T., P. Velardi (1987). A structured 
representation of word-senses for semantic analysis, 
Proc. 3rd European Conference ACL (Copenhaguen), 
pp. 249-257. 

Quillian M.R. (1968). Semantic Memory in M. Minsky 
ed., pp. 227-270, Semantic Information Processing. 
Cambridge (Mass.): MTT Press. 

Tsurumaru H., T. Hitaka, S. Yoshida (1986). An 
attempt to automatic thesaurus construction from an 
ordinary Japanese language dictionary, Proc. COLING 
(Bonn), pp. 445-447. 

Vossen P., W. Meijs, M. den Breeder (1989). Meaning 
and structure in dictionary definitions in B. Boguraev, 
T. Briscoe eds., pp. 171-192, Computational 
Lexicography for Natural Language Processing. New 
York: Longman. 

Wilks Y., D. Pass, G. Cheng-Ming, J.E. McDonald, T. 
Plate, B.M. Slator (1990). Providing Machine 
Tractable Dictionary Tools, Machine Translation, no. 
5, pp. 99-154. 

Winston M.E., R. Chaffin, D. Herrmaim (1987). A 
Taxonomy of Part-Whole Relations, Cognitive 
Science, no. 11, pp. All -AAA. 



